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(1) GENERAL INFORMATION: 

(i) APPLICANT: Kohei MIYAZONO; Takeshe IMAMURA; Peter DEN DIJKE 

(ii) TITLE OF INVENTION: ISOLATED ALK-1 PROTEIN, NUCLEIC ACIDS ENCODING 
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(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fulbright & Jaworski L.L.P. 
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(D) STATE\ New York 
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(B) COMPUTER\ IBM PS/2 
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(A) APPLICATION NUMBER: \9311047.6 
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PRIOR APPLICATION DATA: \ 

(A) APPLICATION NUMBER: 321344.5 



t 




(B) FILING DATE: October 15, 1993 
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(viii) ATTORNEY/AGENT INFORMATION: 
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(ix) TELECOMMUNICATION INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO : 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 283.. 1791 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 1: 

AGGAAACGGT TTATTAGGAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 

CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 24 0 

AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CC ATG ACC TTG GGC 2 94 



Met Thr Leu Gly 
1 



TCC CCC AGG AAA GGC CTT CTG ATG CTG CTG ATG GCC TTG GTG ACC CAG 
Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala Leu Val Thr Gin 
5 10 15 20 



342 



GGA GAC CCT GTG AAG CCG TCT CGG GGC CCG CTG GTG ACC TGC ACG TGT 
Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val Thr Cys Thr Cys 
25 30 35 



390 



GAG AGC CCA CAT TGC AAG GGG CCT ACC TGC CGG GGG GCC TGG TGC ACA 



438 



Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly Ala Trp Cys Thr 
40 45 50 

GTA GTG CTG GTG CGG GAG GAG GGG AGG CAC CCC CAG GAA CAT CGG GGC 486 
Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin Glu His Arg Gly 
55 60 65 

TGC GGG AAC TTG CAC AGG GAG CTC TGC AGG GGG CGC CCC ACC GAG TTC 534 
Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg Pro Thr Glu Phe 
70 75 80 

GTC AAC CAC TAC TGC TGC GAC AGC CAC CTC TGC AAC CAC AAC GTG TCC 582 
Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn His Asn Val Ser 
85 90 95 100 

CTG GTG CTG GAG GCC ACC CAA CCT CCT TCG GAG CAG CCG GGA ACA GAT 63 0 

Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin Pro Gly Thr Asp 
105 110 115 

GGC CAG CTG GCC CTG ATC CTG GGC CCC GTG CTG GCC TTG CTG GCC CTG 678 
Gly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala Leu Leu Ala Leu 
120 125 130 

GTG GCC CTG GGT GTC : CTG GGC CTG TGG CAT GTC CGA CGG AGG CAG GAG 726 
Val Ala Leu. Gly Val Leu Gly Leu Trp His Val Arg Arg Arg Gin Glu 
135 140 145 

AAG CAG CGT GGC CTG CAC AGC GAG CTG GGA GAG TCC AGT CTC ATC CTG 774 
Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser Ser Leu lie Leu 
150 155 160 

AAA GCA TCT GAG CAG GGC GAC ACG ATG TTG GGG GAC CTC CTG GAC AGT 822 
Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp Leu Leu Asp Ser 
165 170 175 180 

GAC TGC ACC ACA GGG AGT GGC TCA GGG CTC CCC TTC CTG GTG CAG AGG 870 
Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu Val Gin Arg 
185 190 195 

ACA GTG GCA CGG CAG GTT GCC TTG GTG GAG TGT GTG GGA AAA GGC CGC 918 
Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly Lys Gly Arg 
200 205 210 

TAT GGC GAA GTG TGG CGG GGC TTG TGG CAC GGT GAG AGT GTG GCC GTC 966 
Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu Ser Val Ala Val 
215 220 225 

AAG ATC TTC TCC TCG AGG GAT GAA CAG TCC TGG TTC CGG GAG ACT GAG 1014 
Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg Glu Thr Glu 
230 _ 235 240 

ATC TAT AAC ACA GTA TTG CTC AGA CAC GAC AAC ATC CTA GGC TTC ATC 1062 
lie Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie Leu Gly Phe lie 
245 250 255 260 

GCC TCA GAC ATG ACC TCC CGC AAC TCG AGC ACG CAG CTG TGG CTC ATC 1110 
Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu Trp Leu lie 
265 270 275 

ACG CAC TAC CAC GAG CAC GGC TCC CTC TAC GAC TTT CTG CAG AGA CAG 1158 




Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu Gin Arg Gin 
280 285 290 

ACG CTG GAG CCC CAT CTG GCT CTG AGG CTA GCT GTG TCC GCG GCA TGC 12 06 

Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val Ser Ala Ala Cys 
295 300 305 

GGC CTG GCG CAC CTG CAC GTG GAG ATC TTC GGT ACA CAG GGC AAA CCA 1254 
Gly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin Gly Lys Pro 
310 315 320 

GCC ATT GCC CAC CGC GAC TTC AAG AGC CGC AAT GTG CTG GTC AAG AGC 1302 
Ala lie Ala His Arg Asp Phe Lys Ser Arg Asn Val Leu Val Lys Ser 
325 330 335 340 

AAC CTG CAG TGT TGC ATC GCC GAC CTG GGC CTG GCT GTG ATG CAC TCA 1350 
Asn Leu Gin Cys Cys lie Ala Asp Leu Gly Leu^ Ala Val Met His Ser 
345 350 355 

CAG GGC AGC GAT TAC CTG GAC ATC GGC AAC AAC CCG AGA GTG GGC ACC 1398 
Gin Gly Ser Asp Tyr Leu Asp lie Gly Asn Asn Pro Arg Val Gly Thr 
360 365 370 

AAG CGG TAC ATG GCA CCC GAG GTG CTG GAC GAG CAG ATC CGC ACG GAC 1446 
Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin lie Arg Thr Asp 
375 380 385 

TGC TTT GAG TCC TAC AAG TGG ACT GAC ATC TGG GCC TTT GGC CTG GTG 1494 
Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe Gly Leu Val 
390 395 400 

CTG TGG GAG ATT GCC CGC CGG ACC ATC GTG AAT GGC ATC GTG GAG GAC 1542 
Leu Trp Glu lie Ala Arg Arg Thr lie Val Asn Gly lie Val Glu Asp 
405 410 415 420 



TAT AGA CCA CCC TTC TAT GAT GTG GTG CCC AAT GAC CCC AGC TTT GAG 1590 
Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp Pro Ser Phe Glu 
425 430 435 

GAC ATG AAG AAG GTG GTG TGT GTG GAT CAG CAG ACC CCC ACC ATC CCT 163 8 

Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro Thr lie Pro 
440 445 450 

AAC CGG CTG GCT GCA GAC CCG GTC CTC TCA GGC CTA GCT CAG ATG ATG 1686 
Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala Gin Met Met 
455 460 465 

CGG GAG TGC TGG TAC CCA AAC CCC TCT GCC CGA CTC ACC GCG CTG CGG 1734 
Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg 
470 475 480 

ATC AAG AAG ACA CTA CAA AAA ATT AGC AAC AGT CCA GAG AAG CCT AAA 1782 
lie Lys Lys Thr Leu Gin Lys lie Ser Asn Ser Pro Glu Lys Pro Lys 
485 490 495 500 

GTG ATT CAA TAGCCCAGGA GCACCTGATT CCTTTCTGCC TGCAGGGGGC 1831 
Val He Gin 

TGGGGGGGTG GGGGGCAGTG GATGGTGCCC TATCTGGGTA GAGGTAGTGT GAGTGTGGTG 1891 

TGTGCTGGGG ATGGGCAGCT GCGCCTGCCT GCTCGGCCCC CAGCCCACCC AGCCAAAAAT 1951 

ACAGCTGGGC TGAAACCTGA AAAAAAAAAA AAA 1984 



(2) INFORMATION FOR SEQ ID NO : 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 
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(2) INFORMATION FOR SEQ ID NO : 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 104.. 1630 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCCGAGTAC CCCAGTGACC AGAGTGAGAG AAGCTCTGAA CGAGGGCACG CGGCTTGAAG 60 

GACTGTGGGC AGATGTGACC AAGAGCCTGC ATTAAGTTGT ACA ATG GTA GAT GGA 115 

Met Val Asp Gly 
1 

GTG ATG ATT CTT CCT GTG CTT ATC ATG ATT GCT CTC CCC TCC CCT AGT 163 
Val Met lie Leu Pro Val Leu lie Met lie Ala Leu Pro Ser Pro Ser 
5 10 15 20 

ATG GAA GAT GAG AAG CCC AAG GTC AAC CCC AAA CTC TAC ATG TGT GTG 211 
Met Glu Asp Glu Lys Pro Lys Val Asn Pro Lys Leu Tyr Met Cys Val 
25 30 35 

TGT GAA GGT CTC TCC TGC GGT AAT GAG GAC CAC TGT GAA GGC CAG CAG 259 
Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His Cys Glu Gly Gin Gin 
40 45 50 

TGC TTT TCC TCA CTG AGC ATC AAC GAT GGC TTC CAC GTC TAC CAG AAA 307 
Cys Phe Ser Ser Leu Ser lie Asn Asp Gly Phe His Val Tyr Gin Lys 
55 60 65 

GGC TGC TTC CAG GTT TAT GAG CAG GGA AAG ATG ACC TGT AAG ACC CCG 355 
Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr Cys Lys Thr Pro 
70 75 80 

CCG TCC CCT GGC CAA GCT GTG GAG TGC TGC CAA GGG GAC TGG TGT AAC 403 
Pro Ser Pro Gly Gin Ala Val Glu Cys Cys Gin Gly Asp Trp Cys Asn 
85 90 95 100 

AGG AAC ATC ACG GCC CAG CTG CCC ACT AAA GGA AAA TCC TTC CCT GGA 451 
Arg Asn lie Thr Ala Gin Leu Pro Thr Lys Gly Lys Ser Phe Pro Gly 
105 110 115 

ACA CAG AAT TTC CAC TTG GAG GTT GGC CTC ATT ATT CTC TCT GTA GTG 499 
Thr Gin Asn Phe His Leu Glu Val Gly Leu lie lie Leu Ser Val Val 



120 125 130 

TTC GCA GTA TGT CTT TTA GCC TGC CTG CTG GGA GTT GCT CTC CGA AAA 547 
Phe Ala Val Cys Leu Leu Ala Cys Leu Leu Gly Val Ala Leu Arg Lys 
135 140 145 

TTT AAA AGG CGC AAC CAA GAA CGC CTC AAT CCC CGA GAC GTG GAG TAT 595 
Phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg Asp Val Glu Tyr 
150 155 160 

GGC ACT ATC GAA GGG CTC ATC ACC ACC AAT GTT GGA GAC AGC ACT TTA 643 
Gly Thr lie Glu Gly Leu lie Thr Thr Asn Val Gly Asp Ser Thr Leu 
165 170 175 180 

GCA GAT TTA TTG GAT CAT TCG TGT ACA TCA GGA AGT GGC TCT GGT CTT 691 
Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly Ser Gly Ser Gly Leu 
185 190 195 

CCT TTT CTG GTA CAA AGA ACA GTG GCT CGC CAG ATT ACA CTG TTG GAG 739 
Pro Phe Leu Val Gin Arg Thr Val Ala Arg Gin lie Thr Leu Leu Glu 
200 205 210 

TGT GTC GGG AAA GGC AGG TAT GGT GAG GTG TGG AGG GGC AGC TGG CAA 787 
Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp Gin 
215 220 225 

GGG GAA AAT GTT GCC GTG AAG ATC TTC TCC TCC CGT GAT GAG AAG TCA 835 
Gly Glu Asn Val Ala Val Lys lie Phe Ser Ser Arg Asp Glu Lys Ser 
230 235 240 

TGG TTC AGG GAA ACG GAA TTG TAC AAC ACT GTG ATG CTG AGG CAT GAA 883 
Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met Leu Arg His Glu 
245 250 255 260 

AAT ATC TTA GGT TTC ATT GCT TCA GAC ATG ACA TCA AGA CAC TCC AGT 931 
Asn lie Leu Gly Phe lie Ala Ser Asp Met Thr Ser Arg His Ser Ser 
265 270 275 

ACC CAG CTG TGG TTA ATT ACA CAT TAT CAT GAA ATG GGA TCG TTG TAC 979 
Thr Gin Leu Trp Leu lie Thr His Tyr His Glu Met Gly Ser Leu Tyr 
280 285 290 

GAC TAT CTT CAG CTT ACT ACT CTG GAT ACA GTT AGC TGC CTT CGA ATA 1027 
Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser Cys Leu Arg lie 
295 300 305 



GTG CTG TCC ATA GCT AGT GGT CTT GCA CAT TTG CAC ATA GAG ATA TTT 1075 
Val Leu Ser lie Ala Ser Gly Leu Ala His Leu His lie Glu lie Phe 
310 315 320 

GGG ACC CAA GGG AAA CCA GCC ATT GCC CAT CGA GAT TTA AAG AGC AAA 1123 
Gly Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Lys 
325 330 335 340 

AAT ATT CTG GTT AAG AAG AAT GGA CAG TGT TGC ATA GCA GAT TTG GGC 1171 
Asn lie Leu Val Lys Lys Asn Gly Gin Cys Cys lie Ala Asp Leu Gly 
345 350 355 

CTG GCA GTC ATG CAT TCC CAG AGC ACC AAT CAG CTT GAT GTG GGG AAC 1219 
Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu Asp Val Gly Asn 
360 365 370 

AAT CCC CGT GTG GGC ACC AAG CGC TAC ATG GCC CCC GAA GTT CTA GAT 1267 
Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp 
375 380 385 

GAA ACC ATC CAG GTG GAT TGT TTC GAT TCT TAT AAA AGG GTC GAT ATT 1315 
Glu Thr lie Gin Val Asp Cys Phe Asp Ser Tyr Lys Arg Val Asp lie 
390 395 400 

TGG GCC TTT GGA CTT GTT TTG TGG GAA GTG GCC AGG CGG ATG GTG AGC 1363 
Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg Arg Met Val Ser 
405 410 415 420 

AAT GGT ATA GTG GAG GAT TAC AAG CCA CCG TTC TAC GAT GTG GTT CCC 1411 
Asn Gly lie Val Glu Asp Tyr Lys Pro Pro Phe Tyr Asp Val Val Pro 
425 430. 435 

AAT GAC CCA AGT TTT GAA GAT ATG AGG AAG GTA GTC TGT GTG GAT CAA 1459 
Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val Cys Val Asp Gin 
440 445 450 

CAA AGG CCA AAC ATA CCC AAC AGA TGG TTC TCA GAC CCG ACA TTA ACC 1507 
Gin Arg Pro Asn lie Pro Asn Arg Trp Phe Ser Asp Pro Thr Leu Thr 
455 460 465 

TCT CTG GCC AAG CTA ATG AAA GAA TGC TGG TAT CAA AAT CCA TCC GCA 1555 
Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gin Asn Pro Ser Ala 
470 475 480 

AGA CTC ACA GCA CTG CGT ATC AAA AAG ACT TTG ACC AAA ATT GAT AAT 1603 
Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Thr Lys lie Asp Asn 
485 490 495 500 

TCC CTC GAC AAA TTG AAA ACT GAC TGT TGACATTTTC ATAGTGTCAA 1650 
Ser Leu Asp Lys Leu Lys Thr Asp Cys 
505 

GAAGGAAGAT TTGACGTTGT TGTCATTGTC CAGCTGGGAC CTAATGCTGG CCTGACTGGT 1710 

TGTCAGAATG GAATCCATCT GTCTCCCTCC CCAAATGGCT GCTTTGACAA GGCAGACGTC 1770 

GTACCCAGCC ATGTGTTGGG GAGACATCAA AACCACCCTA ACCTCGCTCG ATGACTGTGA 1830 

ACTGGGCATT TCACGAACTG TTCACACTGC AGAGACTAAT GTTGGACAGA CACTGTTGCA 1890 




AAGGTAGGGA CTGGAGGAAC ACAGAGAAAT CCTAAAAGAG ATCTGGGCAT TAAGTCAGTG 1950 

GCTTTGCATA GCTTTCACAA GTCTCCTAGA CACTCCCCAC GGGAAACTCA AGGAGGTGGT 2 010 

GAATTTTTAA TCAGCAATAT TGCCTGTGCT TCTCTTCTTT ATTGCACTAG GAATTCTTTG 2070 

CATTCCTTAC TTGCACTGTT ACTCTTAATT TTAAAGACCC AACTTGCCAA AATGTTGGCT 213 0 

GCGTACTCCA CTGGTCTGTC TTTGGATAAT AGGAATTCAA TTTGGCAAAA CAAAATGTAA 2190 

TGTCAGACTT TGCTGCATTT TACACATGTG CTGATGTTTA CAATGATGCC GAACATTAGG 2250 

AATTGTTTAT ACACAACTTT GCAAATTATT TATTACTTGT GCACTTAGTA GTTTTTACAA 2310 

AACTGCTTTG TGCATATGTT AAAGCTTATT TTTATGTGGT CTTATGATTT TATTACAGAA 2370 

ATGTTTTTAA CACTATACTC TAAAATGGAC ATTTTCTTTT ATTATCAGTT AAAATCACAT 243 0 

TTTAAGTGCT TCACATTTGT ATGTGTGTAG ACTGTAACTT TTTTTCAGTT CATATGCAGA 2490 

ACGTATTTAG CCATTACCCA CGTGACACCA CCGAATATAT TATCGATTTA GAAGCAAAGA 2550 

TTTCAGTAGA ATTTTAGTCC TGAACGCTAC GGGGAAAATG CATTTTCTTC AGAATTATCC 2610 

ATTACGTGCA TTTAAACTCT GCCAGAAAAA AATAACTATT TTGTTTTAAT CTACTTTTTG 2670 

TATTTAGTAG TTATTTGTAT AAATTAAATA AACTGTTTTC AAGTCAAAAA AAAA 2724 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

Met Val Asp Gly Val Met He Leu Pro Val Leu He Met He Ala 
1 5 10 15 

Pro Ser Pro Ser Met Glu Asp Glu Lys Pro Lys Val Asn Pro Lys 

20 25 30 

Tyr Met Cys Val Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His 

35 40 45 

Glu Gly Gin Gin Cys Phe Ser Ser Leu Ser He Asn Asp Gly Phe 
50 55 60 
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He 
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Gly 
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z u u 


Thr 
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1 
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Z jU 
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Asn 


He 


Leu 


Gly 








jii D VJ 
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j u 
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Val 
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He 


Val 








420 










Asp 


Val 


Val 


Pro 


Asn 


Asp 


Pro 


Ser 






435 










44 0 
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Gin 


Arc? 


Pro 
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450 
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Thr 
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465 
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Pro 


Ser 
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He 
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Lys 



500 



(2) INFORMATION FOR SEQ ID NO : 5 
(i) SEQUENCE CHARACTERISTICS 
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Ser 
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Trp 
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He 


Thr 


His 


Tvr 

x y i 


His 


Glu 
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Ala 
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Jr 


He 
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i^ 


Glu 
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41 0 
*± 1 \j 










41 R 
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Lys 


Pro 


Pro 


Phe 


Tvr 


*± ^? 










rt \J 






Phe 


Glu 


Asp 


Met 


Arg 


Lys 


Val 


Val 










44 R 








He 


Pro 


Asn 


Arg 


TrD 


Phe 


Ser 


Asp 








460 










Leu 


Met 


Lys 


Glu 


Cys 


Trp 


Tyr 


Gin 






475 










480 


Leu 


Arg 


He 


Lys 


Lys 


Thr 


Leu 


Thr 




490 










495 




Leu 


Lys 


Thr 


Asp 


Cys 









505 





(A) LENGTH: 2 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 310.. 1905 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

GCTCCGCGCC GAGGGCTGGA GGATGCGTTC CCTGGGGTCC GGACTTATGA AAATATGCAT 60 

CAGTTTAATA CTGTCTTGGA ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTTGGAGAA 120 

AAT C AG AAGT ACAGTTTTAT CTAGCCACAT CTTGGAGGAG TCGTAAGAAA GCAGTGGGAG 180 

TTGAAGTCAT TGTCAAGTGC TTGCGATCTT TTACAAGAAA ATCTCACTGA ATGATAGTCA 240 

TTTAAATTGG TGAAGTAGCA AGACCAATTA TTAAAGGTGA CAGTACACAG GAAACATTAC 300 

AATTGAACA ATG ACT CAG CTA TAC ATT TAC ATC AGA TTA TTG GGA GCC 348 



Met Thr Gin 
1 



Leu Tyr lie Tyr lie Arg Leu Leu Gly Ala 
5 10 



TAT TTG TTC ATC ATT TCT CGT GTT CAA GGA CAG AAT CTG GAT AGT ATG 
Tyr Leu Phe lie lie Ser Arg Val Gin Gly Gin Asn Leu Asp Ser Met 
15 20 25 



396 



CTT CAT GGC ACT GGG ATG AAA TCA GAC TCC GAC CAG AAA AAG TCA GAA 
Leu His Gly Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Ser Glu 
30 35 40 45 



444 



AAT GGA GTA ACC TTA GCA CCA GAG GAT ACC TTG CCT TTT TTA AAG TGC 
Asn Gly Val Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys 
50 55 60 



492 



TAT TGC TCA GGG CAC TGT CCA GAT GAT GCT ATT AAT AAC ACA TGC ATA 
Tyr Cys Ser Gly His Cys Pro Asp Asp Ala lie Asn Asn Thr Cys lie 
65 70 75 



540 



ACT AAT GGA CAT TGC TTT GCC ATC ATA GAA GAA GAT GAC CAG GGA GAA 
Thr Asn Gly His Cys Phe Ala lie lie Glu Glu Asp Asp Gin Gly Glu 
80 85 90 



588 



ACC ACA TTA GCT TCA GGG TGT ATG AAA TAT GAA GGA TCT GAT TTT CAG 
Thr Thr Leu Ala Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin 
95 100 105 



636 



TGC AAA GAT TCT CCA AAA GCC CAG CTA CGC CGG AC A ATA GAA TGT TGT 684 
Cys Lys Asp Ser Pro Lys Ala Gin Leu Arg Arg Thr lie Glu Cys Cys 
110 115 120 125 

CGG ACC AAT TTA TGT AAC CAG TAT TTG CAA CCC ACA CTG CCC CCT GTT 732 
Arg Thr Asn Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val 
130 135 140 

GTC ATA GGT CCG TTT TTT GAT GGC AGC ATT CGA TGG CTG GTT TTG CTC 780 
Val lie Gly Pro Phe Phe Asp Gly Ser lie Arg Trp Leu Val Leu Leu 
145 150 155 

ATT TCT ATG GCT GTC TGC ATA ATT GCT ATG ATC ATC TTC TCC AGC TGC 82 8 

lie Ser Met Ala Val Cys lie lie Ala Met lie lie Phe Ser Ser Cys 
160 165 170 

TTT TGT TAC AAA CAT TAT TGC AAG AGC ATC TCA AGC AGA CGT CGT TAC 876 
Phe Cys Tyr Lys His Tyr Cys Lys Ser lie Ser Ser Arg Arg Arg Tyr 
175 180 185 

AAT CGT GAT TTG GAA CAG GAT GAA GCA TTT ATT CCA GTT GGA GAA TCA 924 
Asn Arg Asp Leu Glu Gin Asp Glu Ala Phe lie Pro Val Gly Glu Ser 
190 195 200 205 

CTA AAA GAC CTT ATT GAC CAG TCA CAA AGT TCT GGT AGT GGG TCT GGA 972 
Leu Lys Asp Leu lie Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly 
210 215 220 

CTA CCT TTA TTG GTT CAG CGA ACT ATT GCC AAA CAG ATT CAG ATG GTC 1020 
Leu Pro Leu Leu Val Gin Arg Thr lie Ala Lys Gin lie Gin Met Val 
225 230 235 

CGG CAA GTT GGT AAA GGC CGA TAT GGA GAA GTA TGG ATG GGC AAA TGG 1068 
Arg Gin Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp 
240 245 250 

CGT GGC GAA AAA GTG GCG GTG AAA GTA TTC TTT ACC ACT GAA GAA GCC 1116 
Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala 
255 260 265 



AGC TGG TTT CGA GAA ACA GAA ATC TAC CAA ACT GTG CTA ATG CGC CAT 1164 
Ser Trp Phe Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His 
270 275 280 285 

GAA AAC ATA CTT GGT TTC ATA GCG GCA GAC ATT AAA GGT ACA GGT TCC 1212 
Glu Asn lie Leu Gly Phe lie Ala Ala Asp lie Lys Gly Thr Gly Ser 
290 295 300 

TGG ACT CAG CTC TAT TTG ATT ACT GAT TAC CAT GAA AAT GGA TCT CTC 1260 
Trp Thr Gin Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser Leu 
305 310 315 

TAT GAC TTC CTG AAA TGT GCT ACA CTG GAC ACC AGA GCC CTG CTT AAA 1308 
Tyr Asp Phe Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys 
320 325 330 

TTG GCT TAT TCA GCT GCC TGT GGT CTG TGC CAC CTG CAC ACA GAA ATT 1356 
Leu Ala Tyr Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu lie 
335 340 345 

TAT GGC ACC CAA GGA AAG CCC GCA ATT GCT CAT CGA GAC CTA AAG AGC 1404 
Tyr Gly Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser 
350 355 360 365 

AAA AAC ATC CTC ATC AAG AAA AAT GGG AGT TGC TGC ATT GCT GAC CTG . 1452 

Lys Asn lie Leu lie Lys Lys Asn Gly Ser Cys Cys lie Ala Asp Leu 
370 375 380 

GGC CTT GCT GTT AAA TTC AAC AGT GAC ACA AAT GAA GTT GAT GTG CCC 1500 
Gly Leu Ala Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp Val Pro 
385 390 395 

TTG AAT ACC AGG GTG GGC ACC AAA CGC TAC ATG GCT CCC GAA GTG CTG 154 8 

Leu Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu 
400 405 410 

GAC GAA AGC CTG AAC AAA AAC CAC TTC CAG CCC TAC ATC ATG GCT GAC 1596 
Asp Glu Ser Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp 
415 420 425 

ATC TAC AGC TTC GGC CTA ATC ATT TGG GAG ATG GCT CGT CGT TGT ATC 1644 
lie Tyr Ser Phe Gly Leu lie lie Trp Glu Met Ala Arg Arg Cys lie 
430 435 440 445 

ACA GGA GGG ATC GTG GAA GAA TAC CAA TTG CCA TAT TAC AAC ATG GTA 1692 
Thr Gly Gly lie Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val 
450 455 460 

CCG AGT GAT CCG TCA TAC GAA GAT ATG CGT GAG GTT GTG TGT GTC AAA 1740 
Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys 
465 470 475 



CGT TTG CGG CCA ATT GTG TCT AAT CGG TGG AAC AGT GAT GAA TGT CTA 1788 
Arg Leu Arg Pro lie Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu 
480 485 490 

CGA GCA GTT TTG AAG CTA ATG TCA GAA TGC TGG GCC CAC AAT CCA GCC 1836 
Arg Ala Val Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala 
495 500 505 

TCC AGA CTC ACA GCA TTG AGA ATT AAG AAG ACG CTT GCC AAG ATG GTT 1884 
Ser Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys Met Val 
510 515 520 525 

GAA TCC CAA GAT GTA AAA ATC TGATGGTTAA ACCATCGGAG GAGAAACTCT 1935 
Glu Ser Gin Asp Val Lys lie 
530 

AGACTGCAAG AACTGTTTTT ACCCATGGCA TGGGTGGAAT TAGAGTGGAA TAAGGATGTT 1995 

AACTTGGTTC TCAGACTCTT TCTTCACTAC GTGTTCACAG GCTGCTAATA TTAAACCTTT 2 055 

CAGTACTCTT ATTAGGATAC AAGCTGGGAA CTTCTAAACA CTTCATTCTT TATATATGGA 2115 

CAGCTTTATT TTAAATGTGG TTTTTGATGC CTTTTTTTAA GTGGGTTTTT ATGAACTGCA 2175 

TCAAGACTTC AATCCTGATT AGTGTCTCCA GTCAAGCTCT GGGTACTGAA TTGCCTGTTC 22 35 

ATAAAACGGT GCTTTCTGTG AAAGCCTTAA GAAGATAAAT GAGCGCAGCA GAGATGGAGA 22 95 

AATAGACTTT GCCTTTTACC TGAGACATTC AGTTCGTTTG TATTCTACCT TTGTAAAACA 2355 

GCCTATAGAT GATGATGTGT TTGGGATACT GCTTATTTTA TGATAGTTTG TCCTGTGTCC 2415 

TTAGTGATGT GTGTGTGTCT CCATGCACAT GCACGCCGGG ATTCCTCTGC TGCCATTTGA 2475 

ATTAGAAGAA AATAATTTAT ATGCATGCAC AGGAAGATAT TGGTGGCCGG TGGTTTTGTG 2535 

CTTTAAAAAT GCAATATCTG ACCAAGATTC GCCAATCTCA TACAAGCCAT TTACTTTGQA 2595 

AGTGAGATAG CTTCCCCACC AGCTTTATTT TTTAACATGA AAGCTGATGC CAAGGCCAAA 2655 

AGAAGTTTAA AGCATCTGTA AATTTGGACT GTTTTCCTTC AACCACCATT TTTTTTGTGG 2715 

TTATTATTTT TGTCACGGAA AGCATCCTCT CCAAAGTTGG AGCTTCTATT GCCATGAACC 2775 

ATGCTTACAA AGAAAGCACT TCTTATTGAA GTGAATTCCT GCATTTGATA GCAATGTAAG 2 835 

TGCCTATAAC CATGTTCTAT ATTCTTTATT CTCAGTAACT TTTAAAAGGG AAGTTATTTA 2895 

TATTTTGTGT ATAATGTGCT TTATTTGCAA ATCACCC 2932 



(2) INFORMATION FOR SEQ ID NO : 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 
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XT IIC 




9 n n 
z / u 






nx 0 


f4l n 

OX Li 


A on 


Tip 
X xc 


z ob 








Q p v- 

OCX 


xrp 


1 1 1x7 


oin 


XJC u 


iyr 


A g*o 
nop 


Pho 








320 


Lys 


Leu 


Ala 


Tyr 






335 




He 


Tyr 


Gly 


Thr 




350 






Ser 


Lys 


Asn 


He 



365 





Tip 

lie 


Lys 


Lys 


Asn 




OC1 


Lys 




7 *7 n 










TIC 

J / z> 




val 


Ly s 


rue 


Asn 


OCi 


Asp 


T 1 V-> -v~ 
±111 


Asn 


O Q C 

Job 










3 y u 






Arg 


Val 


uiy 


TVit- 

i in 


Lys 


Airg 


iyr 












A A c 








LeU 


7\ cn 


Lys 


Aon 
.noil 




Irilfc: 


m n 


rlU 








A O C\ 










xrxie 


el Tr 
Vj±y 


Leu 


x±e 


x j.e 


Trp 


LjJ-U 


i v ieu 






A "5 C 

4 .3 b 










A A C\ 


Tip 

lie 


Val 


m n 

ulU 


Ol U 


TN 7"V~ 

iyr 


olll 


Leu 






a c n 
4bU 










4bb 




Pro 


Coy 

CDC J. 


Tyr 


VjlU 


Asp 


I'lcC 


Arg 


blU 












A "7 A 






Prn 


Tip 


Val 


O C- -L 


Asn 


r~Ll y 


ir P 


A en 
noil 










485 








Leu 


Lys 


Leu 


Met 


Ser 


Glu 


Cys 


Trp 








500 










Thr 


Ala 


Leu 


Arg 


He 


Lys 


Lys 


Thr 






515 










520 


Asp 


Val 


Lys 


He 











530 





11C 


r\ 1 Ct 




Leu 


ri~\ , 7 
Vjiy 


Leu 


nla 


















VJl LI 


Val 


Asp 


val 


Ir! U 


Leu 


Asn 


Thr 






395 










400 


nl CX 


Pro 


Glu 


Val 


Leu 


Asp 


Glu 


Ser 




410 










415 




iyr 


He 


Met 


Ala 


Asp 


Tip 
lie 


Tyr 


Ser 


/IOC 

4zb 










^ -3 f\ 
43 0 






/\ia 


Arg Arg 


Cys 


±±.e 


Thr 


Gly Gly 










44 b 








Tyr 


Tyr Asn Met 


val 


Pro 


Ser 


Asp 








460 










v al 


Val 


Cys 


Val 


Lys 


Arg 


Leu 


Arg 






475 










480 


Ser 


Asp 


Glu 


Cys 


Leu 


Arg 


Ala 


Val 




490 










495 




Ala 


His 


Asn 


Pro 


Ala 


Ser 


Arg 


Leu 


505 










510 






Leu 


Ala 


Lys 


Met 


Val 


Glu 


Ser 


Gin 



525 



(2) INFORMATION FOR SEQ ID NO : 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .1515 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ATG GCG GAG TCG GCC GGA GCC TCC TCC TTC TTC CCC CTT GTT GTC CTC 
Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 15 



CTG CTC GCC GGC AGC GGC GGG TCC GGG CCC CGG GGG GTC CAG GCT CTG 96 
Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly Val Gin Ala Leu 
20 25 30 

CTG TGT GCG TGC ACC AGC TGC CTC CAG GCC AAC TAC ACG TGT GAG ACA 144 
Leu Cys Ala Cys Thr Ser Cys Leu Gin Ala Asn Tyr Thr Cys Glu Thr 
35 40 45 

GAT GGG GCC TGC ATG GTT TCC TTT TTC AAT CTG GAT GGG ATG GAG CAC 192 
Asp Gly Ala Cys Met Val Ser Phe Phe Asn Leu Asp Gly Met Glu His 
50 55 60 

CAT GTG CGC ACC TGC ATC CCC AAA GTG GAG CTG GTC CCT GCC GGG AAG 24 0 

His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 

CCC TTC TAC TGC CTG AGC TCG GAG GAC CTG CGC AAC ACC CAC TGC TGC 2 88 

Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys Cys 
85 90 95 

TAC ACT GAC TAC TGC AAC AGG ATC GAC TTG AGG GTG CCC AGT GGT CAC 336 
Tyr Thr Asp Tyr Cys Asn Arg lie Asp Leu Arg Val Pro Ser Gly His 
100 105 110 

CTC AAG GAG CCT GAG CAC CCG TCC ATG TGG GGC CCG GTG GAG CTG GTA . 3 84 

Leu Lys Glu Pro Glu His Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 , 

GGC ATC ATC GCC GGC CCG GTG TTC CTC CTG TTC CTC ATC ATC ATC ATT 432 
Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie lie lie lie 
130 135 140 

GTT TTC CTT GTC ATT AAC TAT CAT CAG CGT GTC TAT CAC AAC CGC CAG 480 
Val Phe Leu Val lie Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
145 150 155 160 

AGA CTG GAC ATG GAA GAT CCC TCA TGT GAG ATG TGT CTC TCC AAA GAC 52 8 

Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 
165 170 175 

AAG ACG CTC CAG GAT CTT GTC TAC GAT CTC TCC ACC TCA GGG TCT GGC 576 
Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

TCA GGG TTA CCC CTC TTT GTC CAG CGC ACA GTG GCC CGA ACC ATC GTT 624 
Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr lie Val 
195 200 205 

TTA CAA GAG ATT ATT GGC AAG GGT CGG TTT GGG GAA GTA TGG CGG GGC 672 
Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 



# 



CGC 


TGG 


AGG 


GGT 


GGT 


GAT 


GTG 


GCT 


GTG 


AAA 


ATA 


TTC 


TCT 


TCT 


CGT 


GAA 


720 


Arg 


Trp 


Arg 


Gly 


Gly 


Asp 


Val 


Ala 


Val 


Lys 


He 


Phe 


Ser 


Ser 


Arg 


Glu 




£, £. O 










9 7 n 
Z J U 










9 "5 c: 










0 a r\ 




GAA 


mn 


J. v_ JL 


J. VJO 


1 1 


ann 






nan 


r\Xr\ 


Tar 


pan 


ann 




aTn 




1 CZ Q 
/DO 


dl n 
\J J» u. 


A v-z-t 

r^i. y 


OCX. 


irp 


XT 11" 


r\±. y 




Ala 


m n 


Tip 


iyr 


m n 


T V-i >- 
1 11 IT 


vai 


Mot* 


Leu 












245 










250 










O C c; 
£1 j j 






CGC 


CAT 


GAA 


AAC 


ATC 


CTT 


GGA 


TTT 


ATT 


GCT 


GCT 


GAC 


AAT 


AAA 


GAT 


AAT 


816 


ax y 




w _L LI 


Aon 


Tip 
IXC 


Leu 






Tl 0 


hid 


A 1 -a 


Asp 


Asn 


Lys 


Asp 


Asn 










260 










265 










270 








GGC 


ACC 


TGG 


ACA 


CAG 


CTG 


TGG 


CTT 


GTT 


TCT 


GAC 


TAT 


CAT 


GAG 


CAC 


GGG 


864 


Gly 


Thr 


Trp 


Thr 


Gin 


Leu 


Trp 


Leu 


Val 


Ser 


Asp 


Tyr 


His 


Glu 


His 


Gly 








275 










280 










& O ZJ 










TCC 


CTG 


TTT 


GAT 


TAT 


CTG 


AAC 


CGG 


TAC 


ACA 


GTG 


ACA 


ATT 


GAG 


GGG 


ATG 


912 


Ser 


Leu 


Phe 


Asp 


Tyr 


Leu 


Asn 


Arg 


Tyr 


Thr 


Val 


Thr 


He 


Glu 


Gly 


Met 






9 qo 

-7 W 










Z -7 D 










j u u 












ATT 


AAG 


CTG 


GCC 


TTG 


TCT 


GCT 


GCT 


AGT 


GGG 


CTG 


GCA 


CAC 


CTG 


CAC 


ATG 


960 


He 


Lys 


Leu 


Ala 


Leu 


Ser 


Ala 


Ala 


Ser 


Gly 


Leu 


Ala 


His 


Leu 


His 


Met 














Tin 










"31 R 










JZU 




GAG 


ATC 


GTG 


GGC 


ACC 


CAA 


GGG 


AAG 


CCT 


GGA 


ATT 


GCT 


CAT 


CGA 


GAC 


TTA 


1008 


Glu 


He 


Val 


Gly 


Thr 


Gin 


Gly 


Lys 


Pro 


Gly 


He 


Ala 


His 


Arg 


Asp 


Leu 












325 










J J V 










J J J 






AAG 


TCA 


AAG 


AAC 


ATT 


CTG 


GTG 


AAG 


AAA 


AAT 


GGC 


ATG 


TGT 


GCC 


ATA 


GCA 


1056 


Lys 


Ser 


Lys 


Asn 


He 


Leu 


Val 


Lys 


Lys 


Asn 


Gly 


Met 


Cys 


Ala 


He 


Ala 










340 










345 










J Zj \j 








GAC 


CTG 


GGC 


CTG 


GCT 


GTC 


CGT 


CAT 


GAT 


GCA 


GTC 


ACT 


GAC 


ACC 


ATT 


GAC 


1104 


Asp 


Leu 


Gly 


Leu 


Ala 


Val 


Arg 


His 


Asp 


Ala 


Val 


Thr 


Asp 


Thr 


He 


Asp 








355 










360 










^ 










ATT 


GCC 


CCG 


AAT 


CAG 


AGG 


GTG 


GGG 


ACC 


AAA 


CGA 


TAC 


ATG 


GCC 


CCT 


GAA 


1152 


He 


Ala 


Pro 


Asn 


Gin 


Arg 


Val 


Gly 


Thr 


Lys 


Arg 


Tyr 


Met 


Ala 


Pro 


Glu 






3 70 










_> / _) 






















GTA 


CTT 


GAT 


GAA 


ACC 


ATT 


AAT 


ATG 


AAA 


CAC 


TTT 


GAC 


TCC 


TTT 


AAA 


TGT 


1200 


Val 


Leu 


Asp 


Glu 


Thr 


He 


Asn 


Met 


Lys 


His 


Phe 


Asp 


Ser 


Phe 


Lys 


Cys 




~> 0 -J 










_> -7 VJ 




















4 n n 




GCT 


GAT 


ATT 


TAT 


GCC 


CTC 


GGG 


CTT 


GTA 


TAT 


TGG 


GAG 


ATT 


GCT 


CGA 


AGA 


1248 


Ala 


Asp 


He 


Tyr 


Ala 


Leu 


Gly 


Leu 


Val 


Tyr 


Trp 


Glu 


He 


Ala 


Arg 


Arg 












405 










41 0 










41 ^ 

T -L 3 






TGC 


AAT 


TCT 


GGA 


GGA 


GTC 


CAT 


GAA 


GAA 


TAT 


CAG 


CTG 


CCA 


TAT 


TAC 


GAC 


1296 


Cys 


Asn 


Ser 


Gly 


Gly 


Val 


His 


Glu 


Glu 


Tyr 


Gin 


Leu 


Pro 


Tyr 


Tyr 


Asp 










420 










425 










430 








TTA 


GTG 


CCC 


TCT 


GAC 


CCT 


TCC 


ATT 


GAG 


GAA 


ATG 


CGA 


AAG 


GTT 


GTA 


TGT 


1344 


Leu 


Val 


Pro 


Ser 


Asp 


Pro 


Ser 


He 


Glu 


Glu 


Met 


Arg 


Lys 


Val 


Val 


Cys 








4^ R 

*± J Zj 




















*± *± z> 










GAT 


CAG 


AAG 


CTG 


CGT 


CCC 


AAC 


ATC 


CCC 


AAC 


TGG 


TGG 


CAG 


AGT 


TAT 


GAG 


1392 


Asp 


Gin 


Lys 


Leu 


Arg 


Pro 


Asn 


He 


Pro 


Asn 


Trp 


Trp 


Gin 


Ser 


Tyr 


Glu 






450 










455 










460 












GCA 


CTG 


CGG 


GTG 


ATG 


GGG 


AAG 


ATG 


ATG 


CGA 


GAG 


TGT 


TGG 


TAT 


GCC 


AAC 


1440 



# • 



Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

GGC GCA GCC CGC CTG ACG GCC CTG CGC ATC AAG AAG ACC CTC TCC CAG 1488 
Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin 
485 490 495 

CTC AGC GTG CAG GAA GAC GTG AAG ATC TAACTGCTCC CTCTCTCCAC 1535 
Leu Ser Val Gin Glu Asp Val Lys lie 
500 505 

ACGGAGCTCC TGGCAGCGAG AACTACGCAC AGCTGCCGCG TTGAGCGTAC GATGGAGGCC 1595 

TACCTCTCGT TTCTGCCCAG CCCTCTGTGG CCAGGAGCCC TGGCCCGCAA GAGGGACAGA 1655 

GCCCGGGAGA GACTCGCTCA CTCCCATGTT GGGTTTGAGA CAGACACCTT TTCTATTTAC 1715 

CTCCTAATGG CATGGAGACT CTGAGAGCGA ATTGTGTGGA GAACTCAGTG CCACACCTCG 1775 

AACTGGTTGT AGTGGGAAGT CCCGCGAAAC CCGGTGCATC TGGCACGTGG CCAGGAGCCA 1835 

TGACAGGGGC GCTTGGGAGG GGCCGGAGGA ACCGAGGTGT TGCCAGTGCT AAGCTGCCCT 1895 

GAGGGTTTCC TTCGGGGACC AGCCCACAGC ACACCAAGGT GGCCCGGAAG AACCAGAAGT 1955 

GCAGCCCCTC TCACAGGCAG CTCTGAGCCG CGCTTTCCCC TCCTCCCTGG GATGGACGCT 2 015 

GCCGGGAGAC TGCCAGTGGA GACGGAATCT GCCGCTTTGT CTGTCCAGCC GTGTGTGCAT 2 075 

GTGCCGAGGT GCGTCCCCCG TTGTGCCTGG TTCGTGCCAT GCCCTTACAC GTGCGTGTGA 2135 

GTGTGTGTGT GTGTCTGTAG GTGCGCACTT ACCTGCTTGA GCTTTCTGTG CATGTGCAGG 2195 

TCGGGGGTGT GGTCGTCATG CTGTCCGTGC TTGCTGGTGC CTCTTTTCAG TAGTGAGCAG 2255 

CATCTAGTTT CCCTGGTGCC CTTCCCTGGA GGTCTCTCCC TCCCCCAGAG CCCCTCATGC 2315 

CACAGTGGTA CTCTGTGT 2333 



(2) INFORMATION FOR SEQ ID NO : 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met 


Ala 


Glu 


Ser 


Ala 


Gly 


Ala 


Ser 


Ser 


Phe 


Phe Pro Leu Val 


Val 


Leu 


1 








5 










10 




15 




Leu 


Leu 


Ala 


Gly 


Ser 


Gly 


Gly 


Ser 


Gly 


Pro 


Arg Gly Val Gin 


Ala 


Leu 








20 










25 




30 






Leu 


Cys 


Ala 


Cys 


Thr 


Ser 


Cys 


Leu 


Gin 


Ala 


Asn Tyr Thr Cys 


Glu 


Thr 






35 










40 






45 






Asp Gly 


Ala 


Cys 


Met 


Val 


Ser 


Phe 


Phe 


Asn 


Leu Asp Gly Met 


Glu 


His 




50 










55 








60 






His 


Val 


Arg 


Thr 


Cys 


He 


Pro 


Lys 


Val 


Glu 


Leu Val Pro Ala Gly Lys 


65 










70 










75 




80 


Pro 


Phe 


Tyr 


Cys 


Leu 


Ser 


Ser 


Glu 


Asp 


Leu 


Arg Asn Thr His 


Cys 


Cys 



